Implementation of the DWT using Intel IA-32 SIMD Extensions
نویسندگان
چکیده
This paper presents some results on the implementation of the DWT (DWT) trough the lifting scheme by using general purpose processor SIMD extensions. We perform image analysis and reconstruction up to 3 levels of decomposition, using the DWT factored into lifting steps for the 9/7 wavelet filter pair. The algorithm was implemented in “C” code and evaluated in terms of performance and image degradation. Three approaches were used: floating-point representation, integer fixed-point representation and SIMD extensions integer code. The results obtained when compared to floating-point code implementation, indicate that the processing time for fixed-point is around 54% and SIMD extensions code is around 24.2%. The average PSNR results are also better for fixed-point and SIMD extensions than with floating-point code implementation. Key-Words: DWT, Lifting Scheme, SIMD Extension, Intel IA-32 GPP Architecture
منابع مشابه
A Portable Short Vector Version of Fftw
This paper presents a portable short vector extension for the popular FFT library Fftw. Fftw is a freely available portable FFT software-library that achieves top performance across a large number of platforms. The newly developed extension enables the utilization of short vector extensions like Intel’s SSE and SSE 2 as well as Motorola’s AltiVec for any problem sizes. The method is independent...
متن کاملIA-32 Execution Layer: a two-phase dynamic translator designed to support IA-32 applications on Itanium-based systems
IA-32 Execution Layer (IA-32 EL) is a new technology that executes IA-32 applications on Intel® Itanium® processor family systems. Currently, support for IA-32 applications on Itanium-based platforms is achieved using hardware circuitry on the Itanium processors. This capability will be enhanced with IA-32 EL—software that will ship with Itanium-based operating systems and will convert IA-32 in...
متن کاملPentium III Processor Implementation Tradeoffs
This paper discusses the implementation tradeoffs of the Pentium III processor. The Pentium III processor implements a new extension of the IA-32 instruction set called the Internet Streaming Single-Instruction, MultipleData (SIMD) Extensions (Internet SSE). The processor is based on the Pentium Pro processor microarchitecture. The initial development goals for the Pentium III processor were ...
متن کاملVectorization of the 2D Wavelet Lifting Transform Using SIMD Extensions
This paper addresses the vectorization of the lifting-based wavelet transform on general-purpose microprocessors in the context of JPEG2000. Since SIMD exploitation strongly depends on an efficient memory hierarchy usage, this research is based on previous work about cacheconscious DWT implementations [1,2,3]. The experimental platform on which we have chosen to study the benefits of the SIMD e...
متن کاملAn Implementation of Parallel 1-D FFT Using SSE3 Instructions on Dual-Core Processors
In the present paper, an implementation of a parallel one-dimensional fast Fourier transform (FFT) using Streaming SIMD Extensions 3 (SSE3) instructions on dual-core processors is proposed. Combination of vectorization and the block six-step FFT algorithm is shown to effectively improve performance. The performance results for one-dimensional FFTs on dual-core Intel Xeon processors are reported...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008